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The  purpose  of  this  research  was  to  compare  a two-dimensional 
polygon  discrimination  learning  (DL)  test  designed  by  Arima  and 
Young  with  the  Armed  Services  Vocational  Aptitude  Battery  (ASVAB) 
currently  being  used  for  military  selection. 

A sample  of  65  high  school  students  who  had  taken  the  ASVAB 
in  the  military's  High  School  Testing  Program  were  given  the  DL 
test  during  the  first  two  months  of  1979.  Little  correlation 
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as  the  dependent  variable  indicated  that  the  subtest  General 
Information  (GI)  predicted  Information  Processing  Rate  (IPR) 
score  to  some,  extent.  For  whites  only,  the  Armed  Forces 
Qualification  Test  (AFQT)  also  served  to  predict  IPR  score. 
Results  indicate  that  the  DL  test  measures  some  other  aspect 
of  ability  other  than  that  demonstrated  on  the  ASVAB  and  may 
therefore  be  of  use  in  recruit  screening. 
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ABSTRACT 


The  purpose  of  this  research  was  to  compare  a two-di- 
mensional polygon  discrimination  learning  (DL)  test  designed 
by  Arima  and  Young  with  the  Armed  Services  Vocational  Aptitude 
Battery  (ASVAB)  currently  being  used  for  military  selection. 

A sample  of  6 5 high  school  students  who  had  taken  the 
ASVAB  in  the  military's  High  School  Testing  Program  were 
given  the  DL  test  during  the  first  two  months  of  1979.  Little 
correlation  was  found  between  the  two  tests.  Multiple  re- 
gression with  IPR  as  the  dependent  variable  indicated  that 
the  subtest  General  Information  (GI)  predicted  Information 
Processing  Rate  (IPR)  score  to  some  extent.  For  whites  only, 
the  Armed  Forces  Qualification  Test  (AFQT)  also  served  to 
predict  IPR  score.  Results  indicate  that  the  DL  test  measures 
some  other  aspect  of  ability  other  than  that  demonstrated 
on  the  ASVAB  and  may  therefore  be  of  use  in  recruit 
screening. 
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I.  INTRODUCTION 

The  advent  of  the  all-volunteer  force  in  1973  dictated 
many  changes  in  personnel  management  procedures  and  conditions 
within  the  military.  The  largest,  most  immediate  effect  was 
that  there  no  longer  existed  a massive  pool  from  which  to 
pick  individuals  meeting  high,  rather  arbitrarily  defined 
standards  of  mental  and  physical  ability.  In  a very  short 
time,  the  military  services  have  had  to  shift  from  the  luxury 
of  administratively  selecting  from  an  unlimited  source  to 
the  requirement  to  first  create  a source  through  more  pro- 
nounced recruiting  and  then  to  select  a slightly  smaller 
number  of  entrants  from  this  much  smaller  pool. 

The  basic  personnel  procurement  problem  facing  the  military 
is  that  it  has  been  able  only  with  great  effort  to  meet  the 
numerical  requirements  for  new  recruits.  The  present  Chief 
of  Naval  Operations,  Admiral  Hayward,  recently  estimated  the 
petty  officer  shortage  in  the  Navy  to  be  reaching  30,000.  He 
also  stated  that  this  shortfall  could  possibly  result  in 
unfilled  committments  (Sinaiko,  1978).  There  are  myriad 
reasons  offered  for  manning  shortfalls  ranging  from  the  pass- 
ing of  the  baby  boom  to  the  increasing  unattractiveness  of 
military  work  (Cooper,  1978j  Defense  Manpower  Commission, 

1976) . 

There  are  numerous  ways  which  present  and  projected  short- 
falls can  be  combatted.  Retaining  more  present  members  of 
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the  armed  services  is  perhaps  one  of  the  best  as  well  as 
one  of  the  most  difficult.  Allowing  more  women  into  the 
service  is  another  obvious  means.  Mental,  physical,  and 
moral  guidelines  could  be  relaxed  and  immediately  alleviate 
recruiting  problems.  There  is  an  ever-increasing  list  of 
alternatives  for  manning  the  armed  forces,  each  with  a 
benefit  and  each  with  a price. 

There  is  one  alternative,  however,  which  is  less  generally 
considered.  That  is  to  improve  our  present  selection  pro- 
cedures by  devising  more  effective  mental- testing  programs. 

The  purpose  of  the  following  research  was  to  study  such  a 
possible  improvement  in  the  services'  enlistment  testing. 

Traditionally  the  measure  of  an  individual’s  mental 
ability  has  been  taken  through  the  use  of  various  paper-and- 
pencil,  verbally  oriented  tests.  The  present  forms  of  the 
Armed  Services  Vocational  Aptitude  Battery  (ASVAB)  is  one  such 
paper-and-pencil  test.  This  battery,  when  used  alone  to 
screen  applicants  for  mental  ability,  suffers  by  its  design 
limitations.  Estes  (197*0  points  out  that  an  individual's 
successful  passing  of  a verbal-type  test  provides  useful  in- 
formation as  to  the  extent  to  which  the  various  prerequisites 
for  successful  performance  have  been  simultaneously  satisfied 
by  the  individual's  combination  of  inherent  capacities  and 
past  experiences.  But  failure  gives  little  information 
because  there  is  no  indication  as  to  why  the  individual  failed. 
There  are,  unfortunately,  many  more  reasons  other  than  limited 
ability. 
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Individuals  can  possess  high  mental  capability  and  still 
do  poorly  when  tested  in  the  traditional  manner  if  they  have 

1 

not  received  the  background  and  training  necessary  (and 
assumed  by  the  examiner) . Words  used  in  the  test  may  be 
strange;  the  subject  matter  may  be  foreign;  the  general  for- 
mat of  the  test  may  be  new;  an  individual  may  give  an  answer 
which  is  meaningful  in  his  own  frame  of  reference  but  not 
within  that  of  the  examiner.  Two  people  with  equal  mental 
potential  can  make  impressively  different  scores  on  a test 
of  mental  ability  if  only  one  has  been  "taught"  to  take  that 
type  of  test.  Thus  present  tests  would  seem  to  be  unfair  to 
the  bright  but  unorthodox  person,  to  the  culturally  disad- 
vantaged, and  to  the  naive  individual  who  lacks  experience 
in  taking  standardized  tests  (Holtzman,  1971) • 

Since  the  ASVAB  is  one  such  standardized  test,  and  since 
it  is  the  sole  means  of  identifying  those  "mentally  qualified" 
for  duty,  it  is  quite  conceivable  that  the  armed  forces  are 
turning  away  a substantial  group  of  otherwise  qualified  young 
men  and  women  simply  because  of  the  format  of  the  test. 

Since,  also,  the  culturally  disadvantaged  and  the  inexperienced 
test  talkers  tend  to  be  nonwhite  in  this  country,  equal  oppor- 
tunity requirements  are  perhaps  not  being  completely  satisfied. 

The  Equal  Employment  Opportunity  Commission  (EEOC) , es- 
tablished to  enforce  the  Civil  Rights  Act  of  1964,  has  looked 
fairly  critically  at  employment  testing  in  recent  years.  The 
importamt  relevant  point  in  the  commission's  guidelines  is 

10 


r 


that  employment  tests  should  be  used  only  when  there  is 
demonstrated  evidence  of  their  validity  for  the  purpose,  and 
that  tests  must  not  discriminate  unfairly  against  minority 
groups  (Federal  Register,  Vol.  36,  1971).  Much  progress  in 
both  mental  measurement  and  validation  procedures  must  be 
realized  before  these  guidelines  can  be  satisfied.  Minori- 
ties still  score  lower  on  service  entrance  examinations  than 
whites  although  it  has  yet  to  be  shewn  that  nonwhites  perform 
less  well  on  the  job  (Lockman,  1976;  Bilinski,  197*0.  It 
would  be  well  for  the  services  to  resolve  this  incongruity  of 
their  own  volition  since  there  will  probably  be  pressure  from 
outside  to  do  so  in  the  future. 

The  research  presented  here  was  undertaken  to  study  one 
alternative  means  of  testing  mental  capabilities  of  potential 
recruits.  The  test  was  originally  designed  by  Arima  (1978) 
as  a test  of  learning  ability.  It  is  a culture-fair  or 
culture-free  approach  in  that  an  attempt  has  been  made  to 
limit  penalties  caused  by  different  cultural  backgrounds. 

It  was  anticipated  that  a test  similar  to  this  culture-fair 
test  could  be  used  to  supplement  the  present  ASVAB  causing 
the  new  total  test  battery  to  be  less  discriminatory  against 
culturally  different  backgrounds. 


II.  PREVIOUS  CULTURE-FREE  TEST  ATTEMPTS 


I I 


There  have  been  numerous  attempts  at  devising  culture- 
free  devices  for  testing  individuals  and  predicting  various 
qualities  or  outcomes.  Although  the  approaches  have  been 
quite  different,  the  goal  has  been  similar!  to  arrive  at  some 
measure  of  a person's  abilities  in  such  a way  as  to  not  allow 
that  person's  cultural  situation  to  be  a deciding  factor  in 
the  outcome. 

The  Porteus  Maze  Test  and  the  Navy  Maze  Test  (Cory,  1971) 
attempted  to  measure  learning  ability,  especially  in  mental 
group  IV  personnel.  The  purpose  was  to  help  select  a portion 
of  those  who  had  failed  standard  service  tests  to  be  admitted 
into  the  military.  The  testee  had  to  trace  his  way  out  of  or 
into  different  mazes  and  was  graded  on  various  procedures 
and  responses  as  well  as  successful  completion. 

Glickman,  et  al.  (1971)  examined  many  tests  attempting 
culture-fair  prediction  of  performance  in  such  widely  varying 
groups  as  taxi  drivers,  college  students,  insurance  salesmen, 
and  sewing  machine  workers.  Training  simulators,  psychomotor 
tests,  opinion/personality/interest  questionnaires,  and  work 
sampling  were  investigated.  Unfortunately,  it  was  found  that 
even  these  less  traditional  devices  had  either  low  predictive 
validity  or  were  biased  toward  racial  groups. 

Siegel  and  Leahy  (197*0  describe  an  attempt  to  predict 
performance  in  the  fleet  by  a type  of  job  sampling.  They 
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were  interested  in  measuring  learning  ability  on  small  tasks 
related  to  the  machinists  mate  rating.  Their  work  produced 
quite  encouraging  results,  however  the  technique  would  prove 
hard  to  use  with  "mass  production"  selection  processes.  The 
time  required  to  screen  one  individual  would  be  prohibitive. 

Of  significance  is  the  fact  that  they  proved  that  a test  of 
learning  ability  can  be  used  effectively  as  a nonbiased  pre- 
dictor of  job  performance. 

Cory,  et  al.  (1973)  reported  an  interesting  attempt  at 
testing  recruits  undergoing  basic  training.  The  recruits 
were  tested  on  their  ability  to  perform  tasks  they  had  recent- 
ly learned.  The  test  scores  were  then  correlated  with  GCT, 

ARI,  and  AFQT  scores.  The  experiment  was  discontinued  because 
it  was  determined  that  it  was  not  possible  to  get  enough  of 
a distribution  in  scores  on  the  task  performance  test.  What 
was  of  interest  was  that  there  was  low  correlation  between 
the  Recruit  Training  Test  and  the  more  standard  GCT/ARI  and 
AFQT,  suggesting  that  the  two  types  of  tests  measured  dif- 
ferent abilities. 

Tests  which  measure  ability  to  learn  such  as  those  of 
Siegel  and  Leahy  and  Cory  seem  to  hold  the  most  promise.  Un- 
fortunately they  suffer  from  a time  and  equipment  intensiveness 
which  limits  efficient  use  on  a large  scale.  What  is  needed 
is  a measure  of  learning  ability  which  could  be  taken  in  a 
short  period  of  time  to  be  compatible  with  present  Armed  Forces 
Examining  and  Entrance  Station  (AFEES)  procedures  and  time 
constraints . 
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Young  (1975)  and  Arima  (1978)  described  a nonverbal  dis- 
crimination learning  (DL)  test1  they  constructed  and  admin- 
istered to  a group  of  recruits  at  the  Naval  Training  Center 
(NTC) , San  Diego,  California.  It  was  decided  that  a non- 
verbal format  would  reduce  the  effect  of  previous  educational 
experiences  and  cultural  background.  The  specific  shapes  came 
from  work  done  by  Arnoult  (195^>  1956)  and  are  shown  in 
Figure  1 . These  shapes  were  formed  from  computer  generated 
random  numbers  indicating  points  which  were  then  connected  by 
lines.  Shapes  of  this  type  were  preferable  to  profiles  of 
objects  because  of  the  culture-free  goal. 

Discrimination  learning  (DL)  was  chosen  as  the  format 
because  of  its  possible  potential  for  measuring  general 
learning  ability  as  opposed  to  some  more  specific  ability  as 
would  be  measured  in  the  more  common  employment  testing  pat- 
terns. Work  sampling,  questionnaires,  psychomotor  tests  and 
the  like  could  be  expected  to  have  somewhat  higher  validity 
within  one  specific  occupation.  What  is  needed,  though,  is 
a test  which  could  measure  a more  universal  quality.  The 
armed  forces  have  too  many,  quite  different,  job  requirements 

*DL  is,  simply,  learning  to  discriminate  a "correct"  choice 
from  a group  of  two  or  more  items.  It  is  usually  done  by  re- 
peated trials  and  errors  with  some  reinforcement  procedure 
used  when  the  desired  choice  is  made.  As  more  and  more 
"correct"  choices  are  made,  learning  can  be  shown  to  be 
taking  place. 


i 


1 l i 4 b b 

k * * 


10  11  12 


13  14  15  16  17  18 


19  20  21  22^  » - 

v A k-  ^ 

25  26  27  * 2 8 . 29  30 


23  24 


Figure  1.  Shapes  selected  for  use  in  assembling 
stimulus  lists. 

(From  Arnoult,  1956) 
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to  benefit  from  excessively  specific  employment  aptitude 
tests  for  selection  purposes. 

The  DL  test  has  a final  qualification  which  is  useful  in 
the  end  requirement  of  screening  potential  recruits  for  the 
armed  forces.  A DL  type  of  test  can  be  administered  in  a 
very  short  period  of  time.  This  characteristic  becomes  quite 
important  when  one  considers  that  prospective  recruits  have 
already  a fairly  full  schedule  of  mental  and  physical  exam- 
inations during  their  one  day  at  the  AFEES  (Navy  Recruiting 
Manual,  1978). 

The  procedures  used  for  test  construction  and  preliminary 
experiments  are  well  described  by  Young  (1975)  and  Arima  (1978) 
and  will  not  be  detailed  here.  The  tests  they  devised  were 
administered  to  160  male  U.S.  Navy  recruits  at  NTC,  San 
Diego  in  order  to  evaluate  the  characteristics  of  the  test 
under  conditions  as  close  to  operational  as  possible  and  to 
investigate  the  appropriateness  of  the  various  test  parameters 
such  as  scoring  technique,  composition,  and  presentation 
methods.  Among  the  results  attained  were  the  followings 
(Young,  1975?  Arima,  1978) 

1.  Learning  did  take  place  during  the  test. 

2.  A self-paced  mode  of  test  administration  appeared 
to  give  more  useful  information  than  a machine-paced 
mode.  It  did  not  restrict  a superior  performer. 

3.  No  significant  difference  was  seen  between  white  and 
nonwhite  performance  on  the  self-paced  test. 


4.  Performance  did  not  correlate  highly  with  performance 
on  standard  verbal-oriented  intelligence  tests  (at 
that  time,  the  General  Classification  Test  (GCT) . 

The  research  which  follows  was  designed  to  begin  where 


this  previous  work  left  off. 
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IV.  EXPERIMENTAL  GOAL 


In  the  time  since  recruits  from  the  Naval  Training  Center 
were  administered  the  Arima-Young  test  there  has  been  a major 
change  in  the  test  batteries  used  for  selection.  At  that 
time  potential  enlistees  were  given  the  Armed  Forces  Quali- 
fication Test  (AFQT)  and  some  sort  of  supplemental  test  bat- 
tery which  varied  in  design  and  use  from  service  to  service. 

Since  then,  all  services  have  adopted  a uniform  testing 
program  for  selection  and  classification  of  most  new  recruits. 
Today  each  volunteer  is  given  the  Armed  Services  Vocational 
Aptitude  Battery  (ASVAB)  which  consists  of  twelve  subtests, 
described  in  Table  1.  Sone  special  selection  tests  are  still 
administered  for  speciality  areas  — nuclear  power  ratings  in 
the  Navy,  for  example.  Scores  from  three  of  the  ASVAB  sub- 
tests — Work  Knowledge,  Arithmetic  Reasoning,  and  Space 
Perception  — are  combined  into  a general  scale  of  mental 
ability.  This  combined  score  is  known  as  the  AFQT,  a name 
held  over  from  when  there  was  a dedicated  test  by  that  name 
as  mentioned  above.  The  new  AFQT  score  is  used  to  separate 
volunteers  into  various  mental  groups,  and  it  is  the  primary 
means  of  determining  mental  eligibility  for  military  service. 
Scores  from  all  subtests  are  used  for  classification  purposes 
once  basic  mental  eligibility  has  been  established. 

The  purpose  of  the  research  described  here  was  to  compare 
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determine  if  their  approach  to  culture-free  testing  measured 
some  different  capability  than  that  measured  by  the  A3VA3. 

In  order  for  this  new  type  of  test  to  be  useful,  it  must  be 
capable  of  identifying  some  new  group  of  individuals  whc  na  '• 
the  potential  to  succeed  in  military  service. 


TABLE  1 


Subtests  of  the  ASVAB  Form  5 


Name  of  Test 

Number 
of  Items 

Testing  Time 
in  Minutes 

(GI)  General  Information 

15 

07 

(NO)  Numerical  Operations 

50 

03 

(AD)  Attention  to  Detail 

30 

05 

(WK)  Word  Knowledge 

30 

10 

(AR)  Arithmetic  Reasoning 

20 

20 

(SP)  Space  Perception 

20 

12 

(MC)  Mathematical  Knowledge 

20 

20 

(El)  Electronic  Information 

30 

15 

(MC)  Mechanical  Comprehension 

20 

15 

(GS)  General  Science 

20 

10 

(SI)  Shop  Information 

20 

08 

(AI)  Automotive  Information 

20 

10 

f 


V.  TEST  DESCRIPTION 

A.  TEST  CONSTRUCTION 

Six  pairs  of  figures  were  used,  selected  from  those  shown 
in  Figure  1.  They  were  the  same  as  Arima  and  Young's  Stimulus 
List  1 which  was  used  in  the  self-paced  phase  of  their  experi- 
ment. This  list  contained  figures  previously  determined  to 
have  the  least  similarity  between  figures  in  a pair  and  also 
the  least  similarity  between  pairs  (Young,  1975) • Figure  2 
depicts  the  figures  used. 

One  figure  from  each  pair  was  choosen  to  be  the  correct 
response.  The  role  of  a die  was  used  to  make  this  assignment 
which  resulted  in  different  correct  figures  than  on  the  Arima- 
Young  test.  Order  of  presentation  of  the  pairs  was  the  same 
for  both  tests. 

The  test  was  designed  so  that  one  pair  of  figures  at  a 
time  was  revealed.  The  subject  chose  which  figures  he  or  she 
felt  was  the  right  response  (guessing  at  first)  by  pressing 
a clear  panel  covering  that  figure.  If  the  correct  choice 
was  made,  the  test  apparatus  would  cycle  to  the  next  pair. 

If  the  choice  was  incorrect,  nothing  would  happen  until  the 
subject  chose  the  other  alternative.  The  above  correctional 
procedure  was  the  only  reinforcement  used.  The  six  pairs 
were  each  shown  ten  times  in  random  order  for  a total  of  60 
frames.  Table  2 shows  the  order  of  pair  presentation.  The 
entire  test  was  self-paced. 
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Figure  2.  Test  Figures 
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TABLE  2 


Order  of  Stimulus  Set  Presentation 


Repetition 


Figure  Order 


2; 

If 

55 

6 ? 

3 5 

4 

1; 

2s 

6 j 

4; 

3s 

5 

6 ; 

2s 

5s 

3s 

1 ; 

4 

2; 

35 

6; 

4; 

1 j 

5 

4; 

55 

2s 

35 

6; 

1 

5s 

2s 

6 j 

4; 

3s 

1 

4; 

5 5 

6 j 

35 

2s 

1 

3s 

6 ; 

1; 

2s 

5s 

4 

6 ; 

1; 

2s 

4; 

5s 

3 

2; 

4; 

5s 

35 

1 s 

6 

Note . Figure  numbers  refer  to  those  shown  in  Figure  1 
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Other  constraints  used  in  the  construction  of  the  test 
were « 

1.  The  same  two  figures  were  always  displayed  together. 

2.  The  left-right  order  was  changed  randomly  with  the  con- 
straint that  each  figure  appeared  on  the  left  and  right 
an  equal  number  of  times. 

3.  Order  within  each  series  of  six  was  varied. 

4.  All  six  pairs  were  presented  before  one  was  repeated. 

5.  The  same  pair  was  never  presented  back-to-back. 

6.  All  figures  retained  the  same  "upright"  orientation  — 
i.e.f  they  were  never  turned  around. 

B . TEST  HARDWARE 

The  equipment  used,  shown  in  Figure  3»  was  capable  of 
presenting  one  pair  of  figures  at  a time  to  the  subject,  de- 
termining correct  and  incorrect  responses,  providing  total 
figure  exposure  time,  and  counting  total  correct  and  incorrect 
responses.  All  equipment  was  manufactured  by  Behavior  Con- 
trol Institute  (BCI) . 

The  system  centered  around  the  BCI  Stimulus  Response 
Programmer.  The  programmer  was  set  to  operate  in  the  multiple 
choice  mode  which  was  capable  of  displaying  four  choices 
under  four  clear  panels  (channels  A,  B,  C,  and  D)  on  the  top 
of  the  machine.  A choice  was  made  by  depressing  one  of  these 
panels  which  caused  various  electrical  circuits  to  be  completed 
depending  on  how  the  machine  was  programmed.  Since  only  two 
choices  were  possible  in  the  present  test,  the  panels  corres- 
ponding to  channels  A and  D were  deactivated  and  blocked. 


I 


24 


Machine  programming  was  accomplished  with  the  use  of  the 


same  block  of  fan-folded  paper  that  displayed  each  pair  of 
figures.  As  the  paper  cycled  through  the  machine  exposing 
one  pair  at  a time  and  stopping  until  correct  selection  was 
made,  the  machine  "read"  a code  punched  in  one  side  of  the 
paper  which  indicated  the  correct  response  and  initiated 
movement.  The  machine  stopped  automatically  at  the  end  of 
the  test. 

A BCI  Four  Choice  Auxiliary  Control  Console  acted  as  an 
interface  between  the  programmer  and  a BCI  Scoring  Indicator 
which  cumulatively  tallied  correct  and  incorrect  responses. 

A timer  was  added  to  the  system  and  mounted  on  the  Four  Choice 
Auxiliary  Control  Console  and  electrically  interfaced  with  it 
in  such  a way  that  the  timer  ran  only  during  actual  figure 
exposure  time.  It  would  stop  when  the  paper  was  cycling  to 
a new  pair. 

An  electronic  counter  was  added,  built  into  the  rear  of 
the  Scoring  Indicator.  It  was  designed  to  activate  a buzzer 
when  six  correct  responses  in  a row  were  made  by  a subject. 

It  became  unreliable  during  the  experiment  and  was  not  used. 

C . SCORING 

Scoring  was  done  by  calculating  an  Information  Processing 
Rate  (IPR)  for  each  subject.  Each  frame  was  considered  to 
contain  one  bit  of  information*  the  correct  figure  of  a 
displayed  pair.  A subject  processed  one  bit  of  information 
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when  he  or  she  picked  the  correct  figure.  Therefore  a rate 
of  information  processing  could  be  computed  by  dividing  total 
correct  responses  by  total  exposure  time,  both  of  which  could 
be  read  directly  from  the  test  apparatus.  IPR  results  dis- 
cussed and  displayed  have  been  multiplied  by  1000  in  order  to 
allow  easier  manipulation  and  discussion. 

A second  means  of  scoring  had  initially  been  planned.  It 
was  hypothesized  that  another  way  to  measure  performance  was 
to  record  hew  many  frames  were  presented  before  a subject 
scored  six  correct  responses  in  a row,  indicating  he  or  she 
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had  "learned"  all  six  pieces  of  information  — a trials-to- 
criterion  ueasure.  The  equipment  used  to  signal  when  six 
correct  in  a row  was  achieved  malfunctioned  early  in  the  ad- 
ministration of  the  test  to  subjects,  however,  and  this  second- 
ary method  of  scoring  could  not  be  used.  It  was  discovered 
while  this  scoring  system  still  worked  that  most  subjects 
still  missed  many  responses  subsequent  to  scoring  six  in 
a row  right.  Thus  it  appears  that  a simple  count  of  the 
number  of  frames  required  to  score  six  in  a row  may  have  been 
an  inaccurate  measure  of  learning. 

It  is  apparent  that  no  real  information  processing  took 
place  for  the  first  six  frames  since  the  subject  had  never 
seen  the  pairs  before  and  could  only  guess  which  figure  in 
each  pair  was  correct.  It  was  felt  that  no  advantage  was 


in  the  interest  of  administration  and  scoring  simplicity  re- 
sults of  the  first  six  frames  were  not  and  could  not  be 
separated  from  the  results  on  the  next  5^  frames.  All  scores 
would  have  been  slightly  higher  had  only  the  last  5^  frames 
been  used  to  compute  IPR. 
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VI.  TEST  ADMINISTRATION 


A.  SUBJECTS 

In  order  to  most  accurately  compare  ASVAB  with  the  Arima- 
Young  test  it  was  necessary  to  locate  subjects  who  could  take 
both  tests  prior  to  any  selection  process  for  military  service. 

Only  in  that  way  would  the  lower  spectrum  of  ability  (as 
measured  by  the  ASVAB)  be  represented  for  each  test.  For 
that  reason  it  was  not  possible  to  administer  the  Arima-Young 
test  at  Recruit  Training  Centers.  Only  subjects  who  had  al- 
ready scored  sufficiently  high  on  the  AFQT  would  have  been 

: 

represented.  Even  volunteers  tested  at  the  AFEESs  would  have 
been  preselected  to  some  extent  and  would  have  presented  a 
biased  sample. 

] 

As  part  of  its  recruiting  effort  the  military  administers 
a High  School  Testing  Program.  A form  of  the  ASVAB  is  given 
to  high  school  students  as  a counseling  device  for  school 
counselors.  The  test  is  available  to  all  students.  While 
no  direct  recruiting  takes  place  during  the  testing  process, 
the  test  results  are  later  made  available  to  recruiters,  and 
the  entire  program  has  proven  to  be  a valuable  recruiting 
tool  (Navy  Recruiting  Manual,  1978).  It  was  this  group  of 
high  school  students  which  seemed  to  offer  the  best  medium  for 
comparison  of  the  two  tests. 

Three  schools  on  the  Monterey  Peninsula  cooperated  in 
allowing  this  experiment.  Pacific  Grove  High  School,  Seaside 
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High  School,  and  Alisal  High  School  in  Salinas  made  available 
a total  of  65  students  who  had  taken  the  ASVAB  Form  5 within 
the  past  four  months.  Table  3 describes  the  sample  by  race 
and  sex. 

The  nonwhites  were  represented  most  heavily  by  individu- 
als of  Spanish  descent  (11  total).  Only  one  black  took  the 
test.  The  rest  of  the  nonwhite  group  of  students  is  divided 
as  follows!  Filipino  descent  (2);  Oriental  descent  (4); 

Native  American  descent  (1);  "Other"  backgrounds  (3).  Racial 
group  was  determined  by  self  report  by  each  subject. 

Students  came  from  grades  9 through  12  with  the  average 
grade  being  10.?.  Ages  represented  were  14  through  18, 
averaging  16.2. 

B . PROCEDURES 

Subjects  were  initially  given  the  test  instructions  in 
groups  of  four  or  less.  Instructions  were  read  to  them  con- 
cerning the  type  of  test  they  would  take,  what  was  required, 
how  they  could  determine  a correct  answer,  and  other  character- 
istics of  the  test.  The  test  instructions  appear  in  Appendix 
A.  Subjects  were  then  shown  an  example  of  what  they  would 
encounter  in  the  form  of  a sample  test  consisting  of  only  two 
pairs.  The  figures  used  were  from  Arnoult's  (1956)  collection 
shown  in  Figure  1 and  were  not  the  same  ones  used  in  the 
actual  test.  The  sample  test  served  to  acquaint  the  students 
with  the  mechanics  of  the  test  apparatus  and  to  help  illustrate 
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TABLE  3 


Summary  of  Student  Participation 


SUBJECT 

GROUP 

Seaside 

SCHOOL 

Pacific  Grove 

Alisal 

TOTAL 

SUBJECTS 

WHITE 

(17) 

(24) 

(2) 

(43) 

Male 

7 

10 

0 

17* 

Female 

10 

14 

2 

26  ♦♦ 

NONWHITE 

(5) 

(10) 

(7) 

(22) 

Male 

2 

7 

2 

11* 

Female 

3 

3 

5 

11** 

ALL  SUBJECTS 

22 

34 

9 

65 

♦Total  males  - 28 
♦♦Total  females  - 37 


the  verbal  instructions.  Any  final  questions  the  group  had 
were  answered  during  and  after  the  sample  test. 

Individuals  took  the  actual  test  alone.  Last  minute 
questions  were  answered,  and  the  test  was  started  and  contin- 
ued until  all  10  repetitions  were  presented.  The  total 
figure  exposure  time  ranged  from  35.5  sec.  to  161.1  sec. 

The  mean  exposure  time  was  79.1  seconds.  Including  a 1.4 
second  cycle  time  between  pairs,  the  entire  test  lasted 
2.7  minutes  on  the  average. 


VII.  RESULTS 


Raw  ASVAB  scores  were  obtained  for  each  subject  from  the 
schools  involved  in  order  to  carry  out  the  required  analysis. 

A summary  of  scores  (by  mean  and  standard  deviation)  is  pro- 
vided in  Table  4 for  all  subjects  and  followed  by  Tables  5 
through  7 for  each  school.  For  each  figure  the  number  (N)  in 
each  category  is  followed  by  the  subtest  scores,  the  composite 
AFQT  score  and,  finally,  the  score  (IPR)  from  the  Arima-Young 
test.  A summary  of  individual  IPR  and  AFQT  scores  by  ethnic 
group  is  presented  in  Appendix  B. 

Zero-order,  product-moment  correlation  coefficients  were 
computed  in  order  to  measure  the  strength  of  the  relationship 
between  IPR  and  ASVAB  scores.  The  results  are  shown  in 
Table  8.  The  highest  subtest  correlation  can  be  seen  from 
Table  8 to  be  with  the  General  Information  (GI)  subtest  with 
a correlation  coefficient  of  .3^*  A low  but  positive  cor- 
relation statistically  significant  at  the  .05  level  was  also 
found  for  the  AFQT.  Figure  4 is  a scattergram  of  individual 
IPR  scores  plotted  against  the  AFQT  score. 

Subjects  were  then  divided  into  two  groups  depending  on 
the  AFQT  score.  Those  who  scored  in  the  top  50%  nationally 
were  separated  from  those  who  scored  in  the  lower  50%.  Again 
Pearson  product-moment  correlations  were  computed  to  determine 
if  one  group  or  the  other  had  closer  relations  between  the  IPR 
score  and  the  ASVAB  and  AFQT  scores.  The  results  are  shown  in 


TABLE  4 


MEAN  TEST  SCORES  BY  RACE  AND  SEX  - ALL  SCHOOIS 


WHITE 

NONWHITE 

TOTAL 

male 

female 

total 

male 

female 

total 

male 

female 

total 

N 

17 

26 

43 

11 

11 

22 

28 

37 

65 

GI 

10.29 

1.69 

7.85 

I.83 

8.81 

2.13 

10.09 

2.95 

6.46 

2.12 

8.27 

3.12 

10.21 

2.22 

7.43 

1.99 

8.63 

2.50 

WK 

21.88 

5.08 

17-89 

5.60 

19.46 

5.69 

1706 

7.49 

13-46 

6.79 

15.41 

7.26 

20.11 

6.41 

16.57 

6.23 

18.09 

6.50 

MK 

14.4? 

4.19 

13.15 

4.32 

13.67 

4.27 

10.64 

4.43 

11.73 

5-41 

11.18 

4.86 

12.96 

4.62 

12.73 

4.64 

12.83 

4.59 

GS 

12.06 

4.01 

8.92 

3.03 

10.16 

3.74 

8.91 

2.81 

6.73 

3-04 

7.82 

3.07 

10.82 

3.86 

8.27 

3.16 

9.37 

3.68 

NO 

36.53 

7.98 

36.50 

8.05 

36.51 

7.92 

31.91 

9-75 

36.09 

11.40 

34.00 

10.57 

3^*71 

8.84 

36.38 

9.00 

35.66 

8.90 

AR 

13.47 

4.24 

11.96 

3-18 

12.56 

3-67 

10.64 

4.23 

10.00 

3.19 

10.32 

3.67 

12.36 

4.39 

11.38 

3.27 

11.80 

3-79 

El 

17-24 

5-87 

12.31 

4.10 

14.26 

5-39 

14.64 

4.52 

12.82 

2.82 

13.73 

3.8O 

16.21 

5.45 

12.46 

3-73 

14.08 

4.88 

SI 

12.88 

3-77 

8.92 

2.56 

10.49 

3.63 

11.64 

3-78 

6.91 

2.17 

9.27 

3.86 

12.39 

3-76 

8.32 

2.59 

10.08 

3.72 

AD 

13-71 

3-87 

14.73 

3.09 

14.33 

3-41 

14.64 

3-33 

13.09 

4.78 

13.86 

4.10 

14.07 

3.63 

14.24 

3.69 

14.17 

3.63 

SP 

12.41 

5.41 

IO.58 

3-69 

11.30 

4.48 

8.46 

3-39 

9.00 

4.38 

8.73 

3.83 

10.86 

5.05 

10.11 

3-91 

10.43 

4.42 

MC 

11.94 

3.60 

8.35 

3.05 

9-77 

3.69 

9-82 

2.68 

5.82 

1.17 

7.82 

2.87 

11.11 

308 

7.60 

2.86 

9-11 

3.54 

AI 

9.35 

4.89 

7.15 

3-03 

8.02 

3-97 

8.91 

2.91 

5.82 

2.27 

7.36 

3.00 

9.18 

4.16 

6.76 

2.86 

7.80 

3.66 

AFQT 

47.77 

11.36 

40.42 

9-73 

43.33 

10.89 

36.46 

12.45 

32.46 

12.36 

34.46 

12.28 

43.32 

12.87 

38.05 

11.03 

40.32 

12.05 

IPR 

659.06 

248.80 

648.00  652.37 
265.64  256.15 

586.64 

176.89 

450.00 

152.52 

518.32  630.60 
175.69  222.64 

589.14  607.00 
252.75  239.32 

Note. 

Top  number  is  test 

mean. 

Bottom 

number 

is  test  standard 

deviation. 
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TABLE  5 

MEAN  TEST  SCORES  BY  RACE  AND  SEX  - SEASIDE  HIGH  SCHOOL 


WHITE  NONWHITE  TOTAL 


-male 

female 

total 

male 

female 

total  male 

female 

total 

N 

7 

10 

17 

2 

3 

5 

9 

13 

22 

GI 

9.29 

7.6 

8.29 

9.0 

5-33 

6.80 

9.22 

7.08 

7.96 

1.38 

1.78 

1.80 

4.24 

2.52 

3.42 

1.92 

2.10 

2.26 

WK 

23.86 

16.60 

19.59 

20.00 

11.33 

14.80 

23.00 

15.39 

18.50 

3-89 

5-99 

6.28 

5.66 

8.09 

7.95 

4.27 

6.56 

6.80 

MK 

14.71 

11.50 

12.82 

11.00 

9-33 

10.00 

13.89 

11.00 

12.18 

4.11 

2.76 

3.64 

4.24 

6.11 

4.90 

4.20 

3.58 

4.02 

GS 

13.00 

8.60 

10.41 

8.00 

7.33 

7.60 

11.89 

8.31 

9-77 

3-37 

4.20 

4.37 

4.24 

3-22 

3.13 

3-95 

3-90 

4.23 

NO 

36.43 

36.90 

36.71 

36.50 

37.00 

36.80 

36  .44 

36.92 

36.73 

9.29 

8.65 

8.63 

6.36 

14.93 

11.03 

8.35 

9.66 

8.94 

AR 

13.43 

11.10 

12.06 

9.50 

9.67 

9.60 

12.56 

10.77 

11.50 

3.87 

2.69 

303 

3.54 

3-51 

3.05 

3-97 

2.80 

3.36 

El 

19.00 

10.90 

14.24 

13.50 

14.00 

13.80 

17.78 

11.62 

14.14 

8.08 

5-22 

7.53 

0.71 

2.65 

1.92 

7.41 

4.84 

6.63 

SI 

13.71 

7.70 

10.18 

11.00 

6.00 

8.00 

i3.ll 

7.31 

9.68 

4.86 

2.31 

4.60 

1.41 

2.65 

309 

4.40 

2.39 

4.38 

AD 

13.43 

13.40 

13-41 

14.00 

12.33 

13-00 

13.56 

13.15 

13-32 

4.86 

3-47 

3-95 

5.66 

4.73 

4.47 

4.67 

3.60 

3-97 

SP 

13.57 

10.50 

11.77 

9.50 

9.00 

9.20 

12.67 

10.15 

11.18 

5.91 

301 

4.66 

0.71 

6.08 

4.32 

5.43 

3.85 

4.62 

MC 

11.86 

6.90 

8.94 

7.50 

6.00 

6.60 

10.89 

6.69 

8.41 

4.56 

2.08 

4.07 

0.71 

1.00 

1.14 

4.40 

1.89 

3-73 

AI 

9.00 

5-70 

7.06 

10.00 

5.33 

7.20 

9.22 

5.62 

7.09 

4.08 

2.58 

3.58 

1.41 

1.53 

2.86 

3.60 

2.33 

3-37 

AFQT 

50.86 

38.20 

43.41 

39.00 

30.00 

33-60 

48.22 

36.31 

41.18 

11.12 

7-77 

11.03 

9.90 

16.52 

13.61 

11.51 

10.19 

12.07 

IPR 

479.00 

660.00 

585.47  428.50 

348.67 

380.60 

467.78  588.15  538.91 

195.57 

251.06 

241.31 

109.60 

104.29 

101.75 

175-17 

260.24  232.51 

Note 

. Top  number  : 

Ls  test 

mean. 

Bottom 

number 

is  test  standard 

deviation. 
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TABLE  6 

MEAN  TEST  SCORES  BY  RACE  AND  SEX- 
PACIFIC  GROVE  HIGH  SCHOOL 


WHITE 


NONWHITE 


TOTAL 


male 

female 

total 

male 

female 

total 

male 

female 

total 

10 

14 

24 

7 

3 

10 

17 

17 

34 

11.00 

1.56 

8.14 

1.96 

9.33 

2.28 

10.43 

3.10 

7.33 

2.52 

9.50 

3-17 

10.77 

2.25 

8.00 

2.00 

9.38 

2.52 

20.50 

5.54 

19-00 

5.67 

19.63 

5.55 

18.71 

8.02 

17.33 

9-71 

I8.30 

8.02 

19.77 

6.50 

18.71 

6.19 

19.24 

6.27 

0 

• • 

r-l 

14.86 

4.64 

14.63 

4.47 

10.57 

5.26 

11.00 

6.93 

10.70 

5-40 

12.77 

5.00 

14.18 

5-08 

13.47 

5.02 

11.40 

4.45 

9.00 

2.25 

10.00 

3.48 

9.43 

3.05 

8.33 

4.73 

9.10 

3.38 

10.59 

3-95 

8.88 

2.64 

9.74 

3.42 

36.60 

7.46 

35.50 

7.52 

35.96 

7-35 

28.57 

9-93 

27.00 

11.79 

28.10 

9.86 

33-29 

9.21 

34.00 

8.63 

33.65 

8.80 

13.50 

4.70 

12.71 

3.43 

13.04 

3-93 

10.00 

4.80 

10.67 

4.04 

10.20 

4.37 

12.06 

4.92 

12.35 

3-50 

12.21 

4.21 

16.00 

3.68 

13.71 

2.76 

14.67 

301 

15.00 

4.40 

14.00 

1.73 

14.70 

3-71 

15.59 

3-89 

13.77 

2.56 

14.68 

3-37 

12.30 

2.95 

10.21 

2.16 

11.08 

2.67 

12.00 

4.76 

7.67 

3.22 

10.70 

4.67 

12.18 

3.66 

9.77 

2.46 

10.97 

3.31 

13.90 

3.28 

15.43 

2.41 

14.79 

2.84 

14.86 

2.97 

14.00 

6.25 

14.60 

3.84 

14.29 

3.10 

15-18 

3.15 

14.74 

3-11 

11.60 

5.19 

11.00 

4.17 

11.25 

4.52 

8.86 

3.67 

9-00 

4.58 

8.90 

3.70 

10  .47 
4.71 

IO.65 

4.17 

IO.56 

4.38 

12.00 

3-02 

9.36 

304 

10.46 

3-41 

10.57 

2.94 

6.00 

1.00 

9.20 

3-29 

11.41 

2.98 

8.77 

3.31 

10.09 

308 

9.60 

5.58 

8.64 

2.74 

9.04 

4.08 

8.71 

3.04 

7.33 

2.89 

8.30 

2.91 

9.24 

4.60 

8.41 

2.72 

8.82 

3-75 

45.60 

11.59 

42.71 

11.13 

43.92 

11.16 

37.57 

14.71 

37.00 

17.52 

37.40 

14.58 

42.29 

13.16 

41.71 

12.00 

42.00 

12.41 

785.10 

204.15 


651.21 

295.96 


707.00 

265.26 


643.14 
160 . 94 


457.67 

99.71 


587.50 

165.85 


726.65 

195-82 


617.06 

279.63 


671.85 

244.12 


Note.  Top  number  is  test  mean, 
deviation. 


Bottom  number  is  test  standard 


TABLE  7 

MEAN  TEST  SCORES  BY  RACE  AND  SEX  - AUSAL  HIGH  SCHOOL 


WHITE 

NONWHITE 

TOTAL 

male 

female 

total 

male 

female 

total 

male 

female 

total 

N 

0 

2 

2 

2 

5 

7 

2 

7 

9 

GI 

7.00 

1.41 

7-00 

1.41 

10.00 

2.83 

6 . 60 
1.82 

7.51 

2.51 

10.00 

2.83 

6.71 

1.60 

7.44 

2.24 

WK 

16.50 

2.12 

16.50 

2.12 

10.00 

2.83 

12.40 

4.39 

11.71 

3.95 

10.00 

2.83 

13.57 

4.20 

12.78 

4.09 

MK 

9.50 

4.95 

9.50 

4.95 

10.50 

3.54 

13.60 

4.61 

12.71 

4.31 

10.50 

3.54 

12.43 

4.72 

12.00 

4.36 

GS 

10.00 

1.41 

10.00 

1.41 

8.00 

0 

5.40 

1.52 

6.14 

1.77 

8.00 

0 

6.71 

2.63 

7.00 

2.35 

NO 

41.50 

12.02 

41.50 

12.02 

39.00 

9.90 

41.00 

7.14 

40.43 

7.16 

39.00 

9.90 

41.14 

7.62 

40.67 

7.53 

A R 

11.00 

4.24 

11.00 

4.24 

14.00 

0 

9.80 

3-27 

11.00 

3-37 

14.00 

0 

10..  14 
3.24 

11.00 

3-28 

El 

9.50 

3-54 

9-50 

3.54 

14.50 

9-19 

11.40 

3.21 

12.29 

4.82 

14.50 

9.19 

10.86 

3.13 

11.67 

4.53 

SI 

6.00 

0 

6.00 

0 

11.00 

1.41 

7.00 

1.41 

8.14 

2.34 

11.00 

1.41 

6.7 1 
1.25 

7-67 

2.24 

AD 

16.50 

4.95 

16.50 

4.95 

14.50 

4.95 

13.00 

5.05 

13.43 

4.65 

14.50 

4.95 

14.00 

4.90 

14.11 

4.60 

SP 

8.00 

0 

8.00 

0 

6.00 

4.24 

9.00 

4.36 

8.14 

4.22 

6.00 

4.24 

8.71 

3-59 

8.11 
3 .66 

MC 

8.50 

3.54 

8.50 

3.54 

9-50 

2.12 

5.60 

1.52 

6.71 

2.43 

9.50 

2.12 

6.43 

2.37 

7.11 

2.57 

AI 

4.00 

0 

4.00 

0 

8.50 

4.95 

5.20 

2.28 

6.14 

3-19 

8.50 

4.95 

4.86 

1.95 

5-6  7 
2.92 

AFQT 

35.50 

6.36 

35.50 

6.36 

30.00 

7.07 

31.20 

8.35 

30.86 

7.43 

30.00 

7-07 

32.43 

7.59 

31.89 

7.11 

IPR 

565.50 

207.18 

565.50 

207.18 

547.00 

264.46 

506 . 20 
189.86 

517.86 

189-97 

547.00 

264.46 

523.14 

178.96 

528.44 

181.31 

Note.  Top  number  is  test  mean.  Bottom  number  is  test  standard 
deviation. 
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Raw  Score 


Table  9.  B^th  groups  had  similar  nonsignificant  correlations 
for  the  AFQT,  and  the  only  significant  correlations  occurred 
on  the  General  Information  (GI)  and  General  Science  (GS)  sub- 
tests for  the  lower  50%  group.  The  relative  absence  of  sig- 
nificant correlations  is  due,  in  part,  to  the  smaller  sample 
sizes . 

The  sample  was  also  divided  to  determine  how  the  IPR  of 
whites  and  nonwhites  correlated  separately  with  the  ASVAB 
and  AFQT  scores  as  shown  in  Table  10.  Again  neither  correla- 
tion with  AFQT  was  significantly  different  from  zero , and 
significant  correlations  appeared  only  for  nonwhites  on  the 
GI,  GS , and  MC  subtests.  The  difference  between  groups  be- 
comes more  significant  in  this  instance  because  the  sample 
size  is  smaller  for  the  minority  group. 

When  males  and  females  were  considered  separately  (Table 
11) , none  of  the  male  correlations  reached  statistical  signifi- 
cance at  the  .05  level,  but  the  female  IPR  scores  correlated 
significantly  with  three  ASVAB  subtests  and  the  AFQT  and 
approached  significance  (p  £ .06)  on  two  additional  subtests. 

In  an  attempt  to  investigate  the  difference  in  mean  IPR 
and  AFQT  scores  between  these  various  groups,  a series  of 
significance  tests  of  the  difference  of  two  means  was  carried 
out  using  Student's  t test  for  independent  samples.  Means  for 
IPR  and  AFQT  were  compared  between  white  and  nonwhite,  male 
and  female,  and  high  AFQT  50%  and  low  AFQT  50% • The  results 
are  shown  in  Table  12.  Significant  differences  on  the  IPR 
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TABLE  12 


were  found  only  for  the  ethnic  comparison,  but  the  AFQT  was 
significantly  different  for  the  AFQT  grouping  and  the  ethnic 
comparison.  The  former  is  trivial,  of  course,  since  the 
groups  were  divided  on  AFQT. 

Analysis  was  continued  to  examine  the  relationship  of 

the  paper-and-pencil  test  variables  with  the  Arima-Young 

test  while  controlling  for  sex,  ethnic  affiliation  and  school. 

Multiple  regression  with  forward  stepwise  inclusion  was  used 

to  determine  which  variables  were  significant  predictors  of 

the  dependent  variable  IPR  score.  The  stepwise  inclusion 

2 

process  was  stopped  when  the  adjusted  r was  a maximum. 

Dummy  variables  were  assigned  to  represent  ethnic  (white  - 
nonwhite),  sex,  and  school  variables.  Since  GI  had  been  most 
highly  correlated  in  earlier  analysis,  interaction  variables 
were  created  using  GI  and  the  demographic  variables  to  see  if 
GI  predicted  equally  for  all  groups.  AFQT  was  also  used  in 
creating  interactive  variables  because  of  its  variance  between 
groups  and  because  of  its  importance  in  the  service  selection 
process.  Table  13  lists  the  variables  used  in  the  regression 
analysis.  Results  of  the  ensuing  regression  appear  in  Table  14. 

It  can  be  seen  that  only  two  variables  are  of  significant 
use  in  predicting  IPR  scores.  GI  serves  as  a predictor  for 
all  subjects.  AFQT  is  an  aid  in  predicting  IPR  only  for  whites. 

There  is  a possibility  that  scoring  the  learning  test 


might  be  improved  by  weighting  accuracy  more  heavily.  The 
computed  IPR  was  a simple  division  of  correct  answers  by 


TABLE  13 
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VARIABLES  USED  IN  MULTIPLE  REGRESSION 


GI 

Score  on  GI  Subtest 

El 

Score  on  El  Subtest 

WK 

Score  on  WK  Subtest 

MK 

Score  on  MK  Subtest 

GS 

Score  on  GS  Subtest 

NO 

Score  on  NO  Subtest 

AR 

Score  on  AR  Subtest 

SI 

Score  on  SI  Subtest 

AD 

Score  of  AD  Subtest 

SP 

Score  on  SP  Subtest 

MC 

Score  on  MC  Subtest 

AX 

Score  on  AI  Subtest 

AFQT 

Composite  Score 

D1 

1 if  Seaside  High  School;  0 otherwise 

D2 

1 if  Alisal  High  School;  0 otherwise 

D3 

1 if  white;  0 otherwise 

D4 

1 if  Male  s 0 if  female 

GID1 

GI  x D1 

GID2 

GI  x D2 

GID3 

GI  x D3 

GID4 

GI  x D4 

AFD1 

AFQT  x D1 

AFD2 

AFQT  x D2 

AFD3 

AFQT  x D3 

AFD4 

AFQT  x D4 

i 


TABLE  14 


REGRESSION  OF  IPR  ON  MULTIPLE  VARIABLES 


Multiple  r = .489  r2  = .239 

Adjusted  r2=  .189 
Standard  error  = 215*550 


Analysis  of  Variance 

df 

SS 

MS  F 

Sig 

Regression 

4 

877744.55 

219436.14  4.7 

.005 

Residual 

60 

2787705.45 

46461.76 

Variables  in  Equation 

B 

Beta 

Std  Error  B 

F 

Sig 

GI 

32.22 

.34 

II.83 

7.42 

.01 

AFD3 

3.48 

*33 

1.28 

7-33 

.01 

GID1 

-12.49 

-.21 

6.91 

3*27 

N.S . 

El 

-8.67 

-.18 

6 . 16 

1.98 

N.S. 

constant  384.96 


Note.  Wilkinson  (1979)  discusses  some  of  the  weaknesses  in 

determining  the  significance  of  r2  in  stepwise  regres- 
sion. In  his  tables,  determined  by  Monte  Carlo  Sim- 
ulation of  the  distribution  of  r2,  the  above  results 
would  not  be  significant  at  the  . 0 5 level  unless  r2 
was  above  . 31 . 
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' 


; 


i 
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total  time  (multiplied  by  1000  for  manipulation) . Although 

it  was  not  obvious  in  the  experiment,  a subject  could  score 

a high  IPR  simply  by  going  very  quickly  and  guessing  exten-  j 

sively.  A simple  means  of  alleviating  this  potential  problem 

is  to  introduce  a penalty  into  the  scoring.  Subtracting  the 

number  missed  from  the  number  correct  prior  to  dividing  by  j 

total  time  would  achieve  the  desired  result.  This  formula 

was  computed  for  the  subjects  involved  in  the  experiment  and 

found  to  be  fairly  consistent  with  IPR  scores  computed  the 

original  way.  The  two  scoring  methods’  Pearson  product  moment 

correlation  coefficient  equalled  .89,  which  exceeds  the  .01 

level  of  significance.  From  Table  15  it  can  be  seen  that 

accuracy  becomes  more  important  in  the  second  scoring  method 

and  time  ceases  to  have  a significant  effect  on  score.  It 

can  also  be  seen  that  the  people  who  made  the  highest  number  [ 

of  correct  responses  tended  to  be  the  ones  who  also  worked 

the  fastest. 

A further  technique  in  scoring  may  have  been  a still  better 
discriminator  of  ability  in  this  test.  As  was  attempted  in 
the  early  stages  of  the  experiment,  a criterion  of  a certain 
number  of  correct  responses  in  a row  could  be  used  to  determine 
test  completion.  A score  could  then  be  derived  from  either 
time-to-completion  or  total  frames  exposed  or  some  combination 
of  the  two.  A major  advantage  of  this  scoring  method  would 
be  that  the  quick  learner  would  not  be  penalized  by  having  to 
process  ten  (or  however  many)  trials.  There  would  have  to 


. 1.  .... .... 
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TABLE  1 5 


CORRELATIONS  OF  TEST  SCORES  WITH  TIME  AND  NUMBER  CORRECT 


ADJSCR 


No.  Correct 


Time 


IPR 

ADJSCR 

No.  Correct 


89 


• 37** 
.58** 


-.26* 
-.19 
- . 73** 


Notes.  1.)  IPR  = Information  Processing  Rate  = 


No . Correct 
Time 


2. )  ABJSCR  = Adjusted  Score  * it,c.  Correct., Wo  ■ Incorrect 

3. )  *Significant  at  .05  level. 

♦♦Significant  at  .01  level. 
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be  some  upper  limit  of  time  or  frames  to  preclude  test 
marathons , however . 

Internal  reliability  of  the  test  was  measured  using  a 
split-halves  technique  described  by  Bruning  and  Klintz  (1968) . 
Information  had  been  taken  on  35  of  the  subjects  as  to  the 
order  of  correct  and  incorrect  responses.  Odd  and  even  frames 
were  split  and  Pearson’s  product-moment  correlation  was  cal- 
culated between  the  two  halves.  After  correction  to  reflect 
actual  test  length,  a reliability  coefficient  of  .7 7 was 
obtained.  This  compares  well  with  a reliability  coefficient 
of  .84  obtained  by  Young  using  a similar  method. 


I 
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VIII.  DISCUSSION  OF  RESULTS 

It  is  apparent  that  the  Arima-Young  test  does  indeed 
measure  some  ability  other  than  that  measured  by  the  more 
traditional  ASVAB.  The  correlation  of  .26  between  the  IPR  and 
AFQT  scores  shows  that  there  is  very  little  similarity  between 
the  two . 

An  individual's  AFQT  score ( which  at  present  determines 
mental  qualification  for  military  service,  is  more  an  indica- 
tion of  knowledge,  facts,  and  processes  absorbed  by  him  or  her 
up  to  the  time  of  test  administration.  No  matter  what  cogni- 
tive processes  are  involved  during  the  test  itself,  learned 
skills  are  required  to  perform  well.  If  an  individual,  for 
any  reason,  does  not  have  these  skills,  he  or  she  will  not 
score  well  on  the  ASVAB. 

The  Arima-Young  test  apparently  measures  a more  basic 
quality.  Learning  was  shown  to  have  taken  place  in  the 
original  project  (Young,  1975?  Arima,  1978),  and  an  individual's 
ability  to  learn  could  be  differentiated  by  an  information 
processing  rate  (IPR) . Their  test  appears  to  involve  more 
specifically  cognitive  factors  such  as  Memory  Span,  Visual 
Memory,  and  Perceptual  Speed  which  are  discussed  by  Harman 
(1975)*  The  low  correlations  in  Table  8 between  IPR  and  ASVAB 
subtests  indicate  a difference  in  quality  measured. 

Since  the  AFQT  score  was  only  weakly  associated  with  the 
IPR  score,  there  should  then  be  some  individuals  who  would 
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not  have  been  accepted  into  the  military  based  on  ASVAB  re- 
sults who  nevertheless  displayed  above-average  learning 
ability  as  measured  by  the  Arima-Young  test.  In  fact  there 
were  four  subjects  in  the  present  sample  whose  raw  AFQT  scores 
were  below  the  required  mark  of  28  (for  the  Navy)  but  who, 
nevertheless,  received  above-average  IPR  scores  of  607.  Three 
of  these  individuals  were  nonwhite. 

It  can  also  be  seen  (Figure  4)  that  there  was  another 
group  which  did  quite  well  on  the  AFQT  yet  scored  below 
average  on  the  IPR.  It  is  hypothesized  that  these  may  rep- 
resent people  who  make  up  for  a lack  of  superior  ability  by 
hard  studying  — the  so-called  overachievers.  Hard  study 
may  be  effective  for  paper-and-pencil  tests  covering  tradition- 
al subjects;  the  Arima-Young  test  does  not  reward  such 
behavior . 

The  point  is  not  that  the  Arima-Young  test  is  better  than 
the  ASVAB  for  service  selection  purposes,  only  that  it  may 
measure  a more  specific,  basic  ability  which  is  less  depend- 
ent on  previous  schooling.  There  must  obviously  still  be 
some  sort  of  selection  device  such  as  the  ASVAB. 

The  military  carries  on  much  of  its  instruction  for 
various  specialties  in  traditional  classroom  environments  with 
success  heavily  dependent  on  previously  developed  classroom 
skills.  It  would  not  be  practical  for  the  services  to  attempt 
to  bring  all  personnel  up  to  the  point  of  verbal  and  mathemati- 
cal expertise  which  are  assumed  to  be  held  by  students  entering 
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some  of  these  schools.  Therefore  some  form  of  testing  is  re- 
quired to  select  those  who  have  achieved  the  skills  necessary 
to  do  well  in  continued  classroom  training.  The  ASVAB  is  of 
some  use  in  this  respect  (Fischl,  et  al.  1977). 

A test  of  learning  ability  becomes  important  for  those  jobs 
where  less  traditional  forms  of  instruction  are  or  could  be 
used,  those  jobs  where  one  learns  by  watching  and  doing.  On- 
the-job  training  requires  less  of  the  formal,  school  skills 
while  still  requiring  an  ability  to  absorb  new  knowledge. 

A test  such  as  the  Arima-Young  test  could  identify,  from 
those  doing  poorly  on  the  ASVAB,  the  people  who  still  have 
high  potential  to  learn  on  the  job  if  accepted  into  the  mili- 
tary. Its  potential  is  as  an  auxiliary  selection  tool  in  a 
time  of  restricted  supply. 

There  are  certainly  many  other  measures  which  could  be 
taken  to  insure  an  adequate  enlistment  rate  into  the  armed 
forces.  Increasing  retention,  lowering  physical  standards, 
admitting  more  women,  becoming  more  capital  intensive,  or 
reinstituting  the  draft  would  all  accomplish  the  same  thing, 
all  with  a different  group  of  people.  It  is  not  the  goal 
here  to  identify  the  best  alternatives. 

For  the  present  sample  the  Arima-Young  test  was  not  culture 
free  in  the  sense  that  whites  and  nonwhites  did  not  score 
equally.  Regressions  analysis  brought  out  that  there  were 
some  differences  in  predictors  between  the  two  ethnic  groups 
and  that  the  AFQT  was  a predictor  of  the  IPR  score  for  only 
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the  white  subjects.  This  result,  which  is  consistent  with 
the  results  of  the  original  study  (Arima,  1978;  Young,  1975) » 
permits  the  use  of  a single  equation  to  predict  the  IPR  scores 
of  both  groups  and  is  culture-free  in  this  respect. 

It  may  be  that  it  is  impossible  to  devise  a test  of  any 
kind  useful  for  service  selection  purposes  that  does  not  have 
an  adverse  cultural  impact.  Why  performance  on  the  present 
test  was  not  equal  for  both  ethnic  groups  can  only  be  conjec- 
tured. The  test  was  still  a test,  and  those  more  experienced 
with  tests  may  still  have  an  advantage.  Perceptions  of  author- 
ity, if  different  for  each  group,  may  have  an  affect  on  the 
test  results.  The  race  of  the  test  administrator  and  the 
test  environment  may  be  influential  factors.  The  language 
of  the  instructions,  the  words  and  phrases  used,  could  give 
an  edge  to  one  group.  If  there  is  indeed  an  ethnic  difference 
in  IPR  scores  for  the  Arima-Young  test  that  transcends  the 
present  sample  for  high  school  students,  it  would  be  interest- 
ing to  follow  age  groups  down  through  the  primary  grades  to 
determine  if  this  difference  is  inherent  or  if  it  is  in  some 
way  developed  or  learned. 

In  order  to  judge  its  final  potential,  the  Arima-Young 
test  must  first  be  submitted  to  a test  of  its  predictive 
validity  against  some  final  criterion.  The  immediate  problem 
of  course  is  that  of  finding  any  reliable  and  valid  criterion 
of  job  performance.  Supervisor  ratings  could  immediately 
bring  race  back  as  an  issue;  there  would  be  no  final  school 
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grades,  and  grades  are  not  really  a measure  of  performance 
on  the  jobs  advancement  is  based  on  paper-and- pencil  tests, 
something  the  subject  has  already  proven  he  does  poorly.  Per- 
haps one  must  settle  as  Lockman  (1975)  did  on  a simple  measure 
of  survival.  A person  is  successful  if  he  or  she  is  still  on 
the  job  some  time  in  the  future.  If,  somehow,  the  test  proves 
to  be  a valid  predictor  of  job  success,  then  any  ethnic  dif- 
ference in  scores  could  still  be  the  basis  for  adverse  impact 
if  it  is  used  indiscriminately  for  selection  purposes. 


IX.  CONCLUSIONS 


The  following  conclusions  may  be  drawn i 

1.  The  Arima-Young  test  can  in  fact  identify,  from  those 
who  score  poorly  on  the  AFQT,  a group  of  individuals  who  have 
high  learning  ability,  at  least  within  the  narrow  parameters 
of  the  present  test. 

2.  Whites  scored  higher  than  nonwhites  on  the  IPR  test. 

A final  determination  of  culture-fairness  must  await  valida- 
tion of  test  results  with  some  measure  of  success  on  the  job 
as  a criterion. 

3*  Study  of  ethnic  results  for  various  age  groups  starting 
at  an  early  age  may  give  further  insight  as  to  the  reasons  for 
ethnic  differences  in  score  at  the  high  school  level. 

4.  A more  useful  score  may  be  derived  by  subtracting  in- 
correct answers  from  correct  ones  prior  to  computing  an  Infor- 
mation Processing  Rate  (IPR). 


I 


1 


56 


APPENDIX  A 


TEST  INSTRUCTIONS 

The  test  you  will  soon  see  takes  less  than  five  minutes. 
There  is  no  passing  or  failing  score  because  the  test  is  being 
used  for  research  purposes  only. 

You  will  be  seated  in  front  of  a machine  which  will  show 
you  two  figures  at  a time.  The  figures  were  drawn  by  computer 
and  are  not  supposed  to  represent  any  physical  objects.  For 
each  pair  of  figures  I have  picked  one  to  be  the  right  answer 
and  one  to  be  the  wrong  answer.  You  will  be  shown  a number  of 
pairs  of  these  figures.  For  each  pair  I would  like  you  to 
try  to  pick  the  correct  figure  by  pressing  the  clear  panel 
directly  over  that  figure.  If  the  machine  moves  to  the  next 
pair,  you  know  you  have  picked  the  correct  figure.  If  nothing 
happens,  you  should  pick  the  other  figure  by  pressing  the  clear 
panel  over  it.  Pushing  both  panels  at  the  same  time  causes 
an  error. 

You  will  see  the  same  pairs  of  figures  many  times.  The 
first  time  you  see  a pair  you  can  only  guess  which  of  the  two 
figures  I have  chosen  as  correct.  After  you  have  guessed,  try 
to  remember  which  figure  was  correct  so  that  you  can  pick  that 
one  the  next  time  you  see  it.  The  same  two  figures  will  al- 
ways be  together,  and  the  correct  figure  will  remain  the 
same.  At  times  I have  flip-flopped  the  pairs  so  that  the 
correct  answer  may  be  on  either  thB  left  or  the  right  side. 


I 


57 


I 


■ 

Although  a timer  is  being  used,  this  is  not  a timed  test. 
Just  work  as  quickly  as  you  can  without  rushing.  Prom  time 
to  time  a buzzer  may  sound.  Ignore  itj  it  is  simply  built 
into  the  system  and  does  not  affect  you. 

! 

I Here  is  an  example  of  what  you  will  see» 


. 

' 


APPENDIX  B 


INDIVIDUAL  IPR  AND  APQT  SCORES 


SUBJECT 

ETHNIC 

IPR 

APQT 

1 

W 

588 

49 

2 

w 

840 

65 

3 

w 

882 

46 

4 

w 

534 

45 

5 

w 

475 

41 

6 

w 

223 

45 

7 

w 

506 

46 

8 

w 

446 

22 

9 

w 

400 

42 

10 

N 

261 

22 

11 

w 

833 

40 

12 

N 

321 

19 

13 

W 

1090 

30 

14 

w 

469 

43 

15 

w 

466 

39 

16 

N 

351 

32 

17 

w 

957 

44 

18 

w 

5U 

62 

19 

w 

379 

58 

20 

w 

412 

35 

21. 

N 

464 

49 

22 

W 

448 

32 

23 

w 

764 

52 

24 

w 

779 

24 

25 

N 

583 

22 

26 

w 

829 

57 

27 

w 

584 

29 

28 

w 

1020 

40 

29 

w 

318 

54 

30 

N 

431 

55 

31 

N 

568 

36 

32 

w 

644 

44 

33 

w 

398 

42 

34 

w 

652 

48 

35 

N 

442 

43 

36 

w 

492 

43 

37 

N 

706 

26 

38 

N 

822 

63 

39 

W 

988 

52 

40 

N 

844 

40 

41 

W 

365 

28 

42 

N 

456 

24 
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